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1.  INTRODUCTION 


Program  LIBLIST  is  one  of  two  FORTRAN  programs  which  is  used  to  maintain  the 
computerized  cataloging  system  of  the  document  center  of  the  Directorate  of  Aero- 
space Studies.  It  produces  catalog  listings  from  any  data  base  which  is  maintained 
by  its  companion  program,  ADM,  discussed  in  volume  I of  this  report.  LIBLIST  is 
written  specifically  for  the  CDC  6600  computer.  Hence,  it  is  designed  about  a 60- 
bit  computer  word,  six  bit  characters,  and  a particular  64  member  printer  charac- 
ter set.  These  program  peculiarities  will  not  represent  much  difficulty  to  an 
experienced  programmer  desiring  to  adapt  the  program  to  another  large  scale  machine. 
Additionally,  the  program  should  be  easily  converted  from  FORTRAN  to  other  high 
level  languages  provided  suitable  bit  manipulation  capabilities  can  be  provided. 

This  volume  is  designed  to  provide  the  degree  of  understanding  required  to  run 
and  modify  the  program  and  to  make  computer  or  language  conversions  possible. 

Section  2 provides  a general  discussion  of  the  program,  including  an  introduction 
to  a vocabulary  which  is  helpful  in  explaining  the  overall  techniques  of  manip- 
ulating the  data.  Section  3 describes  program  input.  Appendix  A provides  defi- 
nitions for  major  program  variables,  while  the  program  listing,  interspersed 
liberally  with  comments,  is  presented  in  appendix  B. 
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2.  GENERAL  PROGRAM  DISCUSSION 


The  data  base  is  the  total  of  all  information  available  to  the  computer  about 
library  holdings.  It  is  maintained  on  a mass  storage  file  which  is  referred  to  in 
LIBLIST  as  NEWFIL.  The  information  on  NEWFIL  is  organized  by  document,  each  doc- 
ument being  defined  by  at  most  20  descriptors.  The  sum  total  of  information  for  a 
document  is  called  a data  set.  The  individual  descriptors  are  called  data  items. 

The  data  items  in  use  at  present^  are  given  in  table  1 with  descriptive  remarks. 

Data  items  may  contain  multiple  distinct  entries.  This  results  in  that  document 
appearing  under  each  entry  in  a listing  ordered  by  that  data  item. 

Through  inputs  the  data  items  to  appear  in  a listing  are  selected  and  the 
order  of  their  appearance  specified.  There  are  no  restrictions  on  which  data  i tem,s 
may  appear  in  a listing  or  their  order.  When  the  program  receives  this  information 
it  beni  an  exhaustive  examination  of  the  data  base,  document  by  document,  to 
COP'  the  listing.  As  an  example,  suppose  that  the  listing  is  to  be  by  key- 

w d by  several  other  data  items.  As  each  document  data  set  is  read  from 

base  it  is  examined  for  the  presence  of  a keyword  entry.  If  none  exists, 
no  further  processing  of  that  document  occurs.  If  a keyword  entry  is  present,  it 
is  examined  to  determine  if  it  is  single  or  multiple.  For  each  keyword  (up  to  a 
maximum  of  ten)  an  entry  for  the  listing  is  prepared.  Each  of  these  entries  differs 
only  in  the  keyword,  hence  its  eventual  placement  in  the  catalog  listing.  These 
entries  are  called  character  strings. 

Groups  of  these  character  strings  are  prepared,  with  approximately  500  per 
group.  As  each  group  is  completed,  it  is  ordered  alphanumeri cal ly.  Alphanumeric 
ordering  is  essentially  a well  defined  extension  of  alphabetic  ordering.  It  is 
discussed  in  detail  in  section  2.2.  After  ordering  a group,  it  is  written  on  mass 
storage  to  await  further  processing.  When  all  data  sets  from  the  data  base  have 
been  processed  in  this  manner,  there  exists  one  or  more  of  these  ordered  groups  of 
character  strings  which  must  be  combined  to  form  the  one  ordered  group  which  com- 
prises the  catalog  listing.  This  combining  is  done  through  a merging  process 


1.  Other  categories  of  information  can  easily  be  substituted  for  those  given  in 
this  report.  See  volume  I,  section  1. 
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Abstract  This  category  is  not  currently  used. 

Not  used 


similar  to  that  used  to  order  the  subgroups.  The  process  is  described  in  section 
2.3.  When  the  final  ordered  list  is  created  it  must  be  further  processed  (section 
2.4)  to  alter  its  appearance  for  the  actual  printout.  An  example  of  the  final 
result  is  presented  in  figure  1. 

2.1  DATA  BASE 

The  collection  of  all  aocument  information  available  to  LIBLIST  on  file  NEWFIL 
is  the  data  base.  This  information  has  been  organized  by  document  into  data  sets, 
and  in  turn  the  data  sets  have  been  organized  into  the  data  items  listed  in  table  1. 
In  arriving  at  this  structure,  many  points  had  to  be  considered.  The  discussion 
which  follows  will  examine  those  considerations  from  the  point  of  view  of  the  data 
item,  the  data  set,  and  the  data  base,  respectively. 

a.  Data  Items.  Data  items  represent  categories  of  information,  e.g.,  title, 
personal  author,  keywords.  For  a given  document  some  of  these  categories  may  not 
be  represented.  There  may  be  no  personal  author,  for  example.  Some  categories 
may  have  multiple  entries  which  should  be  treated  independently  when  forming  a cat- 
alog listing.  An  obvious  example  is  keywords.  And,  whereas  the  title  for  one 
document  may  occupy  a single  computer  word,  that  for  another  document  may  require 
15  computer  words. 

To  handle  these  variations  the  data  item  was  conceived  as  having  an  indef- 
inite length,  and  as  having  a substructure.  A data  item  has  zero  length  if  there 
is  no  information  about  a document  in  that  category.  The  upper  limit  for  its  length 
is  indefinite.  (The  sum  of  the  lengths  of  all  data  items  in  a data  set  can  not 
exceed  54  computer  words  when  stored  in  the  data  base.)  Notice  that  this  applies 
equally  to  all  data  items.  The  substructure  of  a data  item  is  achieved  by  linking 
multiple  independent  components  together  with  plus  (+)  signs.  This  eliminates  the 
plus  sign  from  the  otherwise  usable  character  set. 

The  CDC  printer  characters  are  given  in  table  2 with  their  display  codes 
(octal  representation  in  the  computer).  Note  that  the  colon  is  not  available  for 


2.  ADM  and  LIBLIST  could  be  modified  to  increase  this  limit.  Tradeoffs  of  program 
lengths  and  running  times  are  involved. 
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use  even  though  it  is  a legitimate  character.  This  is  because  the  display  code 
of  the  colon  is  00,  which  means  that  in  the  alphanumeric  ordering  process  the  colon 
plays  the  role  of  a blank.  More  is  said  of  this  in  section  2.4.  The  plus  (■••)  sign 
is  also  reserved  as  previously  indicated.  Two  consecutive  colons  (blank  substitutes) 
may  not  appear  i. 'bedded  in  a data  item.  Two  consecutive  colons  are  the  indication 
of  tiie  end  of  a data  item.  Hence,  they  must  appear  at  the  end  of  every  data  item, 
even  if  an  extra  computer  word  must  be  added  to  provide  one  or  both  of  them.  The 
remaining  61  characters  are  available  to  be  used  in  any  manner  in  a data  item  with 
the  additional  exception  that  all  punctuation  characters  are  automatically  spaced 
at  the  time  ADM  creates  the  data  base.  Thus,  for  example,  commas  are  always  fol- 
lowed by  a colon  (blank  substitute),  but  never  preceded  by  one.  This  information 
is  given  in  table  3. 


Table  3 

PROGRAMED  PUNCTUATION  SPACING 

No  blank  precedes 

+ -*/  = ?)],.  ; S 
No  blank  follows 
([+-*/= 

A blank  precedes 

( c 

A bl  ank'  fol  lows 

) ] % . . ; $ 

Should  two  characters  with  conflicting  rules  be  adjacent  to  one  another,  that 
applying  to  the  second  character  will  prevail,  e.g.  ) and  - is  spaced  )-. 

Data  items  in  the  data  base  are  stored  in  display  code  and  each  data  item 
occupies  an  integral  number  of  computer  words.  Thus,  for  example,  when  Rand  Corpora- 
tion is  used  as  a corporate  author  it  would  be  stored  in  the  data  base  as  the  two 
computer  words 

2201 1 60400031 722201 7 1 2201241 1 1 71 600000000 
which  when  converted  to  characters  is 

RAND :C0RP0 1 RATION : : : : 

Notice  that  this  data  item  is  terminated  by  at  least  two  consecutive  "blanks,"  i.e., 
colons.  Were  the  corporate  author  Bigdome  Corporation,  then  it  would  appear  in 
character  representation  as: 
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BIGDOME:CO|RPORATION:  :::::::::: 

where  the  extra  word  of  "blanks"  has  been  added  to  provide  the  second  "blank"  of 
the  pair. 


Since  the  rules  discussed  above  apply  to  all  data  items,  all  data  items 
have  essentially  the  same  status.  This  equality  is  exactly  what  allows  catalog 
listings  to  be  ar*^itrary  with  respect  to  which  data  items  appear  and  the  order 
of  their  appearance. 


b.  Data  Set.  To  form  a data  set,  the  data  items  are  assembled  in  consecutive 
locations  in  the  data  base  in  the  order  given  in  table  1. 


Some  method  must  be  used  for  specifying  the  locations  of  each  of  the  data 
items.  The  method  selected  is  to  include  in  the  data  set  a table  giving  the  loca- 
tion of  the  first  word  of  each  data  item  (numbering  the  first  word  of  the  data  set 
1)  and  the  number  of  words  comprising  it.  This  has  been  done  by  packing  this 
information  for  the  20  data  items  into  five  computer  words,  four  data  items  to  a 
word.  Since  a CDC  6600  word  has  60  bits,  this  means  that  the  length  and  location 
information  for  a data  item  must  be  presented  in  15  bits.  The  first  six  are  used 
to  give  the  length;  the  last  nine  the  location.  This  is  shown  in  figure  2. 
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Figure  2.  Format  of  a Word  of  the  Length-Location  Table 


The  correspondence  between  data  item  and  computer  words  and  bit  numbers  is  given 
in  table  4.  The  maximum  expressable  length  of  a data  item  is  77  octal  (63  decimal) 
computer  words.  The  maximum  location  is  777  octal  (511  decimal)  words.  These 
limits  have  proved  more  than  adequate. 

The  data  set  is  completed  by  preceding  the  length-location  table  with  a 
single  computer  word  giving  the  total  length  of  the  data  set. 

The  final  data  set,  thus,  has  the  form  shown  in  figure  3. 
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HE  STRUCTUKE  OF  THE  LENGTH-LOCATION  TABLE 


Length-Location 

Table 


Data  Items 


Length 

of 

Data 

Set 
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1 1 1 — 

1 1 1 — 
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1 1 

1 1 — 

— 1 1 

1 1— H 

1 1 1 — 

1 1 

1 1 

-I h 


H 1 


Figure  3.  Format  of  a Data  Set 

c.  Data  Base.  The  data  base  is  composed  of  the  collection  of  all  data  sets. 

For  reasons  which  will  become  obvious,  it  is  necessary  to  have  these  data  sets 
ordered  in  some  manner  in  the  data  base.  This  is  brought  about  not  by  any  require- 
ment of  LIBLIST,  but  by  the  need  to  be  able  to  delete  or  modify  data  base  data  sets 
with  Program  ADM.  Suppose,  for  example,  that  100  data  sets  are  to  be  deleted  from 
a data  base  of  1000  data  sets  because  the  corresponding  documents  are  no  longer  part 
of  the  collection.  If  there  is  no  order  to  the  data  sets  in  the  data  base,  the  list 
of  data  sets  to  be  removed  will  have  to  be  searched  over  and  over  again  as  the  data 
sets  in  the  data  base  are  examined  one  by  one.  If,  however,  both  lists  are  in 
order,  each  list  will  need  to  be  examined  only  once  to  delete  the  100  data  sets. 

The  ability  to  order  the  data  sets  depends  upon  each  data  set  having  a 
unique  identifier.  For  microfiche  with  AD  numbers,  this  identifier  is  the  AD  number. 
For  all  other  documents  it  is  the  accession  number  assigned  to  it  at  the  Directorate 
of  Aerospace  Studies. 

2.2  THE  CONCEPT  OF  ALPHANUMERIC  ORDERING 

Ordinary  alphabetical  ordering  is  accomplished  by  comparing  two  words  (or  word 
groups)  beginning  with  the  left-most  characters  of  each  and  proceeding  character  by 
character  to  the  right  until  a difference  is  found.  The  two  corresponding  but  dif- 
ferent characters  then  determine  which  word  or  word  group  comes  first  by  which  is 

15 


"alphabetically  smaller"  according  to  the  sequence  blank,  A,  B,  C . . . Z.  Punctu- 
ation is  generally  ignored  in  determining  alphabetical  order. 

The  ordinary  extension  of  alphabetic  ordering  to  include  numbers  - alphanumeric 
ordering  - is  not  consistent  with  this  process.  Instead  of  beginning  with  the  left- 
most digits  of  the  two  strings  of  digits  being  compared  and  proceeding  character  by 
character  until  a difference  is  found,  the  numbers  are  compared  as  entities.  Thus 
nine  precedes  10,  a situation  which  everyone  expects.  This  is,  of  course,  a con- 
vention, and  one  that  does  not  lend  itself  as  efficiently  to  al phanumerical ly  com- 
puter ordering  as  another  which  will  be  described. 

For  efficiency  in  computer  ordering  it  is  desirable  to  allow  the  ordering  to 
proceed  in  the  nonconventional  way  of  comparing  numbers  exactly  as  letters  are  com- 
pared - character  by  character  from  the  left.  Using  this  scheme,  very  nonconven- 
tional ordering  takes  place,  e.g.,  10  precedes  9;  1000  precedes  11,  etc.  This  too 
is  a convention,  but  not  one  everyone  will  readily  adapt  to.  A partial  solution 

3 

is  to  preface  numbers  with  zero.  Thus  09  will  indeed  precede  10.  This  works  as 
long  as  the  two  numbers  being  compared  have  the  same  number  of  digits.  In  most 
cases  this  is  a practical  way  to  obtain  the  traditional  ordering. 

There  is  one  further  difference  between  traditional  alphanumeric  ordering  and 
the  computer  ordering  appearing  in  ADM.  Punctuation  marks  are  considered  in  ADM 
as  characters  just  as  are  the  blank,  letters,  and  the  digits  0-9.  The  alphanumeric 
order  of  all  the  characters  is  that  given  previously  in  table  2.  The  order  corres- 
ponds to  the  numerical  order  of  the  display  codes.  This  consideration  of  punctuation 
in  the  ordering  process  makes  consistency  of  punctuation  essential  if  listings  are 
to  have  the  desired  order  at  all  times.  Below  are  some  pairs  of  expressions  ordered 
traditionally  and  as  the  ADM/LIBLIST  programs  order  them. 


3.  Roman  numerals  provide  another  problem  in  computer  ordering.  For  example 
C(IOO)  will  order  before  I (1),  IX  (9)  before  V(5),  etc.  It  will  generally  pay  to 
replace  Roman  numerals  with  Arabic  numbers. 


Traditional 

ADM/LIBLIST 

1. 

BASE 

BASE 

BASES 

BASES 

2. 

ALPHA9  ALPHA09 

ALPHAIO  ALPHA09 

or 

or 

ALPHAIO  ALPHAIO 

ALPHA9  ALPHAIO 

3. 

1812 

18027 

18027 

1812 

4. 

01812 

01812 

18027 

18027 

5. 

STATISTICS,  VOL.  1 

STATISTICS,  VOL  2 

STATISTICS,  VOL  2 

STATISTICS,  VOL.  1 

These  ordering  characteristics  should  be  understood  and  planned  for  before 
the  first  input  data  set  is  ever  keypunched  for  program  ADM. 

2.3  ALPHANUMERIC  ORDERING  OF  A SET  OF  CHARACTER  STRINGS 

There  are  many  methods  for  computer  ordering  of  a set  of  alphanumeric  data. 
They  vary  widely  in  efficiency  and  ease  of  implementation,  often  as  a function  of 
the  nature  of  the  problem  (for  example,  the  initial  amount  of  order  present  in  the 
data).  The  character  strings  ordered  in  LIBLIST  usually  have  very  little  intrinsic 

4 

order.  A very  efficient  method  for  ordering  in  such  a case  is  based  on  merging 
as  suggested  by  the  "tournament  chart"  of  figure  4.  In  this  context  merging  is  the 
process  of  combining  two  ordered  lists  into  a single  ordered  list  by  repeatedly 
comparing  the  smallest  remaining  element  of  each  list  and  choosing  the  smaller  of 
these  as  the  next  element  of  the  combined  list. 

At  the  left  of  figure  4 are  represented  N (=  16  for  this  example)  subsets  of 
character  strings,  each  subset  containing  a single  character  string.  For  conve- 
nience N is  an  integral  power  of  2.  At  the  first  step  these  subsets  are  merged  by 


4.  An  exception  occurs  when  the  ordering  is  being  done  on  accession  number  or  AD 
number.  In  these  cases,  the  data  are  perfectly  ordered  as  extracted  from  the  data 
base. 
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pairs  (1  and  2,  3,  and  4,  . . . (N-1)  and  N)  to  produce  N/2  ordered  subsets  each 
with  two  character  strings.  At  the  next  step  these  subsets  are  again  merged  by 
pairs,  resulting  in  N/4  ordered  subsets  each  with  four  character  strings.  At  each 
step  of  the  ordering  process  the  number  of  character  string  subsets  decreases  by 
one-half,  while  the  size  of  the  individual  subsets  increases  by  a factor  of  two. 
Since  N = 2'^  (n  an  integer),  after  n steps  in  the  process,  there  remains  one 
ordered  subset  of  N character  strings. 

In  actual  practice,  each  subset  is  formed  only  in  terms  of  where  in  the  array 
of  character  strings  its  members  reside.  This  technique  is  very  efficient  when 
compared  with  actually  moving  the  character  strings  to  form  the  subsets  in  adjacent 
locations  in  the  computer. 

In  LIBLIST  N was  chosen  as  512.  Since  each  of  the  n steps  of  the  process 

Q n 

requires  handling  N strings  and  512=2  =2  , a maximum  of  Nxn  or  512  x 9 = 4608, 
comparisons  are  necessary  to  order  512  character  strings.^  A comparison  takes  only 
a few  microseconds  on  a CDC  6600  making  the  process  quite  fast. 

In  the  event  that  there  are  fewer  than  N character  strings  to  be  ordered,  the 
ordering  must  still  proceed  as  though  there  were  N character  strings.  In  this 
event,  dunriy  one-word  character  strings  are  used  to  complete  the  N entries  for 
ordering.  The  duimy  character  strings  always  order  at  the  end  of  the  final  ordered 
subset.  Thus,  it  takes  as  long  to  order  one  character  string  by  itself  as  N charac- 
ter strings.  This  drawback  is  of  little  practical  consequence,  given  the  overall 
speed  of  the  scheme. 

The  above  method  as  programmed  in  LIBLIST  requires  that  every  character  string 
in  the  set  being  ordered  reside  in  the  computer's  central  memory  during  the  ordering 
process.  This  generates  a storage  requirement  of  512  x 54  = 27648.|g  = 66000g  com- 
puter words  for  the  character  strings.  The  size  of  the  set  being  ordered  could  be 
doubled  and  still  fit  in  the  CDC  6600.  It  is  obvious  however  that  there  is  a limit 


5.  Nxn  is  an  upper  bound  to  the  number  of  comparisons  which  must  be  made,  but  not 
the  least  upper  bound.  In  merging  two  lists  comparisons  stop  as  soon  as  the  elements 
of  one  list  are  exhausted. 
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to  how  large  a set  may  be  ordered  at  one  time  in  central  memory.  This  problem  is 
circumvented  by  introducing  a merging  procedure  using  four  mass  storage  files. 
These  files  are  referred  to  in  the  program  as  TAPE!,  TAPE2,  TAPE3,  and  TAPE4.  At 
the  beginning  of  the  merging  procedure  the  character  strings  to  be  ordered  reside 
on  TAPEl  and  TAPE2  as  indicated  in  figure  5.  Each  of  the  indicated  subsets  is  a 
final  ordered  list  of  the  previously  described  process  for  ordering  512  character 
strings. 


First  Subset  of 

Ordered  Character  Third 

Strings  Subset  (M-l)st  Subset 

TAPEl  

End  of 
File 


Beginning  of 
File 


Second  Subset  Fourth  Subset  Mth  Subset 

TAPE2  

End  of 
File 


Beginning  of 
File 


Figure  5.  File  Structure  at  Beginning  of  Merging  Process 


If  there  are  an  odd  number  of  subsets  of  ordered  character  strings,  the  last  one 
will  be  on  TAPEl.  If  there  are  an  even  number,  the  last  one  will  appear  on  TAPE2. 

The  minimum  number  of  such  subsets  is  one;  the  maximum  is  currently  20.  The  alpha- 
numeric order  within  a subset  is  from  left  to  right  in  figure  5,  i.e.,  A is  to  the 
left  of  Z. 

The  first  merging  operation  involves  the  first  subsets  on  TAPEl  and  TAPE2. 

A character  string  from  each  is  read  and  they  are  compared.  The  alphanumerically 
smaller  is  written  on  TAPE3.  A new  character  string  is  read  from  the  appropriate 
file  to  replace  the  string  written  on  TAPE3,  and  the  process  of  comparison  is 
repeated.  This  is  done  until  both  subsets  of  character  strings  have  been  read  from 
TAPEl  and  TAPE2  and  written  on  TAPE3.  (When  one  of  the  two  subsets  has  had  all  its 
entries  read  and  merged,  the  remainder  of  the  other  subset  is  copied  to  TAPE3.) 

The  subset  on  TAPE3  contains  all  entries  of  the  subsets  from  TAPEl  and  TAPE2  and 
the  merging  has  properly  ordered  them.  The  next  merging  operation  repeats  the  proce- 
dure with  the  second  ordered  subsets  on  TAPEl  and  TAPE2.  The  only  difference  is 
that  the  merged  subset  is  written  on  TAPE4. 
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This  overall  scheme  is  pursued  until  all  subsets  from  TAPE!  and  TAPE2  have 
been  merged  by  pairs,  alternately,  on  TAPE3  and  TAPE4.  Odd  numbered  merging  opera- 
tions are  written  on  TAPE3;  even  on  TAPE4.  In  the  event  that  the  total  number  of 
subsets  in  TAPEl  and  TAPE2  is  odd,  the  final  subset  on  TAPEl  will  not  have  a cor- 
responding subset  on  TAPE2  with  which  to  be  merged.  The  merging  process  then  con- 
sists simply  of  copying  the  last  subset  on  TAPEl  to  TAPE3  or  TAPE4  as  appropriate. 
When  all  data  from  TAPEl  and  TAPE2  have  been  processed,  the  roles  of  TAPEl  and 
TAPE2,  and  TAPE3  and  TAPE4  are  reversed.  Character  strings  are  read  from  TAPE3  and 
TAPE4  and  merged  alternately  on  TAPEl  and  TAPE2. 

Obviously,  the  merging  of  two  subsets  reduces  the  total  number  of  subsets  by 
one.  The  logical  conclusion  of  the  above  process  is  one  ordered  subset  containing 
all  character  strings  for  the  listing.  A graphic  example  is  given  below. 

ORIGINAL  SITUATION 
Subset  1 Subset  3 Subset  5 

Subset  2 Subset  4 

AFTER  FIRST  MERGING  OPERATION 
■r^P£1  Subset  3 Subset  5 

TAPE2  -iykset_4_ 

j^P23  Subset  1 Subset  2 

AFTER  SECOND  MERGING  OPERATION 
TAPE!  -Subset  5 
jAp£3  Subset  1 -*•  Subset  2 

jAprA  Subset  3 Subset  4 

AFTER  THIRD  MERGING  OPERATION 
j^p^3  Subset  1 Subset  2 Subset  5 

TAPE4  Subset  3 -*•  Subset  4 

AFTER  FIRST  MERGING  OPERATION  AFTER  REVERSING  FILE  ROLES 
TAPE!  Subset  1 Subset  2 Subset  3 Subset  4 

TAPE3 
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TAPE! 


AFTER  SECOND  MERGING  OPERATION  AFTER  REVERSING  ROLES 
Subset  1 + Subset  2 + Subset  3 + Subset  4 


TAPE2  — -- 

The  roles  of  the  files  are  reversed  once  again,  and  each  file  reverts  to  its 
original  role.  The  final  merging  operation  results  in  TAPE3  containing  the  final 
ordered  collection  of  character  strings.  If  there  are  1 , 2,  5,  6,  9,  10  . . . 
subsets,  the  final  ordered  collection  of  character  strings  will  reside  on  TAPE3. 
Otherwise,  it  will  reside  on  TAPEl . The  final  phase  of  the  program  converts  the 
character  strings  to  a form  suitable  for  output  and  then  does  the  actual  output. 

2.4  OUTPUT  PROCESSING  OF  THE  ALPHANUMERICALLY  ORDERED  CHARACTER  STRINGS 

At  this  point  in  the  program,  the  character  strings  are  unsuitable  for 
printing.  First,  blanks  are  represented  as  colons.  These  must  be  converted  to 
blanks.  Second,  each  data  item  is  terminated  by  at  least  two  (at  most  11)  con- 
secutive colons.  These  must  be  converted  to  a suitable  "end  of  data  item"  marker 
to  serve  as  a separator  of  adjacent  data  items  in  the  printout.  These  two  proce- 
dures require  a character-by-character  examination  of  each  character  string.  This 
is  the  most  time  consuming  portion  of  the  program. 

A typical  character  string  if  printed  without  these  modifications  might 
look  like:^ 

SOC: : : : : : :ROCKET-BOOSTED:GLIDE:BOMB:DISPERSION:ANALYSIS: (U) : 
SANDIA:LAB0RAT0RIES :34707 : : : : : 

When  processed  it  appears  as: 

SOC  r*  rocket-boosted  GLIDE  BOMB  DISPERSION  ANALYSIS  (U)  SANDIA  LABORATORIES 
34707 


6.  The  CDC  6600  can  not  print  strings  of  colons  as  shown  using  the  logic  of'LIBLIST 
as  12  zero  bits  at  the  end  of  a word  are  taken  as  an  end-of-line  indicator. 
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The  actual  processing  involves  constructing  a new  character  string  from  the  old. 

As  each  character  is  examined  from  the  old  character  string  it  is  either  copied 
into  the  new  character  string,  replaced  (blank  for  colon),  inserted  (r^),  or  dis- 
carded (excess  colons).  When  the  conversion  is  complete,  the  new  character  string 
is  printed. 
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3.  INPUT  TO  PROGRAM  LI  BUST 


Card  input  to  Program  LIBLIST  is  used  to  specify  the  structure  of  data  base 
printouts.  This  is  done  through  use  of  the  ITEM  card  which  contains  three  fields. 
The  first  field  is  read  under  format  AlO  and  contains  ITEM  beginning  in  column  1. 
The  second  field  is  read  under  format  15  (columns  11-15).  The  number  in  this 
field  specifies  the  position  in  the  final  printout  of  the  data  item  specified  by 
the  number  read  from  the  third  field  under  format  FIO.O  (columns  16-25).  The 
data  item  is  identified  by  a number  from  1-20  as  given  in  table  1. 

To  request  the  program  to  produce  a list  consisting  of  document  title,  cor- 
porate author,  and  accession  number  would  require  the  three  cards 

ITEM  1 3.0 

ITEM  2 6.0 

ITEM  3 1.0 

plus  a card  to  signify  that  the  input  for  that  case  has  been  defined.  This  last 
card  contains  ENDCASE  beginning  in  column  1.  It  is  read  by  the  same  format  as  the 
ITEM  cards,  the  numeric  fields  being  ignored.  The  order  of  the  ITEM  cards  defining 
a case  is  immaterial.  However,  if  N (1  £ N £20)  data  items  are  being  requested  in 
a printout,  then  each  of  the  N numbers  specifying  positions  in  the  printout  should 
be  represented  exactly  once  (with  none  greater  than  N).  Multiple  cases  may  be 
stacked  in  any  run,  each  terminated  by  an  ENDCASE  card.  The  input  deck  should 
always  be  terminated  by  an  ENDJOB  card  (ENDJOB  beginning  in  column  1). 

In  addition  to  the  card  input,  a file  containing  the  data  base  must  be  sup- 
plied. This  file  is  produced  by  Program  ADM.  Its  structure  is  discussed  earlier 
in  this  volume  as  well  as  in  the  volume  I of  this  report,  which  describes  Program 
ADM.  This  file  has  the  logical  file  name  NEWFIL.  It  is  equated  to  TAPES  in  the 
program. 
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ENDATA 

ICA(I) 

ICB(I) 

ICT 

IDIR 

IMASK(I) 

INAD(I) 

I.NADS(I) 

IND(I) 


APPENDIX  A 

DEFINITIONS  OF  IMPORTANT  PROGRAM  VARIABLES 

ENDATA=1  indicates  that  all  data  sets  have  been  read  from  the 
data  base. 

An  array  which  contains  the  character  string  resulting  from  a 
read  from  unit  NFRDl . 

An  array  which  contains  the  character  string  resulting  from  a 
read  from  unit  NFRD2. 

A word  containing  one  six-bit  character  with  all  other  bits 
zero.  The  position  of  the  character  within  the  word  is 
dependent  upon  usage. 

IDIR  = -1  causes  data  being  merged  to  be  read  from  TAPEl  and 
TAPE2,  written  on  TAPE3  or  TAPE4.  IDIR  = 1 causes  data  being 
merged  to  be  read  from  TAPE3  and  TAPE4,  written  on  TAPEl  and 
TAPE2. 

For  1=1,  . . .,  10,  a mask  of  six  consecutive  bits  set  to  one 
(mill  or  77  octal),  occupying  bit  positions  (59  - (1-1)6)  to 
(54-(I-l)6).  For  I = 11,  IMASK(ll)  = IMASK(l). 

An  array  containing  the  indirect  addresses  of  the  character 
strings  in  the  ISORT  array  which  are  being  ordered. 

An  array  which  saves  the  INAD  array  for  the  next  step  of  the 
ordering  process.  After  the  ordering  process  is  complete 
INADS(I)=J  gives  the  value  of  J such  that  IS0RT(J,K)  is  the 
I-th  ordered  character  string. 

An  array  used  to  indicate  the  order  and  composition  of  the 
listing.  Order  is  given  by  I.  Composition  is  given  by  IND(I) 
according  to  the  implicit  numbering  of  data  items  specified  in 
table  1 . 
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lODATA(I) 

IPOS 

IPOSP 

IPRINT(I) 
ISORT(I,J) 
ISPACE 
I SUM 
I WORD 

n 

IIP 

ns 


Contains  the  most  recent  data  set  read  from  file  NEWFIL. 

A counter  which  gives  the  number  of  characters  contained  in  the 
resul t of  a character-by-character  examination  of  the  ^irst  data 
item  of  the  list  or  a character  string. 

A position  indicator  used  to  direct  the  placement  of  characters 
coming  from  a character-by-character  examination  of  the  first 
data  item  of  the  list  or  a character  string.  For  IP0SP=1,  .... 

10  bit  positions  (59-f IPOSP-1 )6)  to  54-( IPOSP-1 )6)  are  indicated. 

An  array  containing  the  version  of  a character  set  which  is 
printed  as  part  of  the  catalog  listing. 

The  array  where  the  character  strings  to  be  ordered  are  assembled. 
IS0RT(I,1)  gives  the  length  of  the  character  string. 

A counter  used  to  count  the  consecutive  colons  (blanks)  appearing 
in  a character  string.  ISPACE=2  indicates  the  end  of  a data  item. 

A running  count  of  the  number  of  computer  words  in  the  character 
string  being  assembled  in  the  ISORT  array. 

A counter  giving  the  number  of  the  word  of  the  IPRINT  array  being 
filled  as  a result  of  the  character-by-character  examination  of 
the  raw  character  string. 

The  location  in  the  INAD  array  of  the  indirect  address  of  the 
element  of  the  first  of  the  two  adjacent  subsets  of  the  ISORT 
array  being  meraed,  which  is  to  be  compared  with  an  element  from 
the  second  subset. 

The  location  in  the  INAD  array  of  the  indirect  address  (in  the 
ISORT  array)  of  the  last  element  of  the  first  of  the  two  adjacent 
subsets  being  merged. 

Saves  the  value  of  the  previous  II  for  the  calculation  of  a new  II. 
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The  location  in  the  INAD  array  of  the  indirect  address  of  the 
element  of  the  second  of  the  two  adjacent  subsets  of  the  ISORT 
array  being  merged,  which  is  to  be  compared  with  an  element  from 
the  first  subset. 

I2S  Saves  the  value  of  12  for  later  use. 

LENLIM  The  maximum  number  of  words  allowed  in  a character  string.  At 

present  LENLIM  = 50. 

LENLOC  (I,J)  LENLOC  (1,J)  contains  the  number  of  words  in  the  J-th  data  item. 

LENLOC  (2,J)  indicates  the  location  of  the  first  word  of  the 
data  item  in  the  lODATA  array. 

LF  (I,J)  For  TAPEI,  1=1  ,2,3,4,  and  the  J-th  subset  of  character  strings 

on  TAPEI;  LF  gives  the  length  of  that  subset. 

LIMUP  The  maximum  number  of  character  strings  which  may  be  ordered  at 

any  one  time.  At  present  LIMUP  = 512. 

LIMUPH  LIMUP/2 

LISTTOT  A running  count  of  the  number  of  character  strings  in  the  catalog 

listing  being  created. 

NENTRY  The  number  of  character  strings  to  be  formed  from  a given  data 

set. 

iNF(I)  Maintains  the  count  of  the  number  of  subsets  of  character  strings 

on  TAPEI,  I = 1 ,2,3,4. 

NFRDl  Equals  1 or  3 to  indicate  that  READ(NFRDl)  specified  reading  from 

TAPEI  or  TAPE3,  respectively. 

NFRD2  Equals  2 or  4 to  indicate  that  READ(NFRD2)  specifies  reading  from 

TAPE2  or  TAPE4,  respectively. 


NFTEST  An  indicator  used  in  selecting  the  value  of  NFWRT.  For 

NFTEST  = -1,  NFWRT  = 1 or  3,  as  appropriate.  For  NFTEST  = 1, 
NFWRT  = 2 or  4,  as  appropriate. 

NFWRT  Equals  1,2,3,  or  4,  to  indicate  that  WRITE(NFWRT)  specifies 

writing  on  TAPEl , TAPE2,  TAPE3,  or  TAPE4,  respectively. 

NITEM  The  number  of  data  items  to  be  included  in  the  listing. 

NLINE  The  number  of  lines  needed  to  print  the  character  string  just 

processed. 

NLIST  The  number  of  character  strings  currently  in  the  ISORT  array. 


NOREADl  Equals  0 or  1 to  indicate  that  the  subset  of  character  strings 

being  read  from  the  file  specified  by  NFRDl  has  or  has  not  been 
exhausted,  respectively. 

N0READ2  Similar  to  NOREADl,  except  application  is  to  NFRD2. 


Counts  the  number  of  character  strings  read  from  the  subset  of 
character  strings  being  read  from  unit  NFRDl. 

Similar  to  NRl , except  application  is  to  NFRD2. 

The  combined  number  of  subsets  of  character  strings  residing 
on  TAPEl,  TAPE2,  TAPF3,  and  TAPE4. 

NWTESTl  = 1 when  ICA  array  contains  a character  set  to  be  merged. 

NWTEST2  = 1 when  ICB  array  contains  a character  set  to  be  merged. 

At  the  I-th  stage  (1*1,  . . . , 9)  of  the  ordering  process,  N2 
gives  the  length  of  the  subsets  being  merged,  viz.,  2^^"^^. 
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APPENDIX  B 
PROGRAM  LISTING 


=nCGRA*i  ^ :Bl  IS'  I INPUT  .OUTPO'  .NE^F  IL  . TAPE  I . TAPE2.  'APE3  . 'APE-* . 'AFE5  = 
1NE«FI^) 


: CC'^MENTS  SET  CFP  9''  ASTEpISKS  APE  IN  fcpcE  FCP  ALL  S'ATEPEN'S 

C PCLwGUlNG  '«E  COMMENTS  UP  '0  'mE  NE”T  GPCUP  of  such  COMMENTS.  A 

C CCMmEN’’  pCLwCUEO  3t  a NLMEEP  IN  PAPENThESES  applies  ’’0  'hE 

C fcluCUING  STA'E'^ENTS  'HROUGH  the  statement  numbered  hI'H  'HA' 

0 number. 

0 program  UI3LIST  USES  'HE  FORTRAN  FUNCTIONS  MASKfll.  SHIFT(I.^). 

: AND  MCD(I.J).  MASK! I)  FCRMS  A MASK  OF  I ONES  IN  BIT  POSITIONS  3 

C T-POUGH  (I-ll.  BIT  POSITIONS  APE  NUMBERED  LEFT  TO  RIGHT  FROM  59  '0 
C 3.  SHIFT! I, JI  AS  USED  IN  THIS  program  (J  POSITIVE)  PESUL'S  IN  A 

C ..EF'  END-ARCUND  SHIFT  OF  THE  WORD  I BY  ^ BITS.  MOD(I.J)  CALCULA'ES 

: I ''COULD  j.  '«E  logical  .and.  and.  or.  operators  are  also  used. 

C 'MEY  DO  A BI ’■-BY-BIT  logical  SUM  OR  PRODUCT  OF  THE  QUANTITIES 

: 't-ET  CONNECT. 

C 

COMMON  .A/  INDIBO) ,NI TE  I 

DIMENSION  INADISIBI.  IODATA(IOO).  I SORT ( 5 1 2 . 50 ) . lENLOC ( 2 . =0 1 . LIS 
:’'^1I0.25).  INADS(512).  laiTdl),  IMASKllO).  IPRINT(IQO).  LF!-+.aO), 
2 NFim.  ICA(IOC).  ICB(IOO).  NFP(U) 

INTEGER  ENDATA 

Data  IBI  T- 60,  Sh.hB.hB.  36. 30.2>*.  I8. 12.6.0/ 

C 

Data  lENLIM/50.'  .LIMUP'512/  .MUL'-ST'  1 / 


DO  10  1*1,10 

IMASKI I 1»SHIFT(MASK(6) . IBlTl 1 ) ) 
10  CONTINUE 


initialization  required  for  EACH  catalog  listing 


20  CONTINUE 
call  INPUT 
REWIND  1 
REWIND  2 
REWIND  5 
IMUPH*L  lMUP/2 
lISTTOT-0 


t 


I 

II 
t 

I 

( 


} 
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£nca:a=o 

:3R=-i 
20  30  1 = 1 
20  30  J=1 .20 
L"i ! .J)=0 
30  CCNTINbE 

\r  ( 1 I =NF  ( 2 ) =NE  3 ) =NF  ( 1 =0 
N’T  = 0 


initialization  required  fop  each  subset  of  character  SI’RINGS 


^0  continue 
nlist=o 

do  50  1=1 .LIMUP 
DO  50  J-1 .LENLIM 
ISORTd  . J)  = 10H;  ; 

50  continue 


READ  A DATA  SET  FRCM  FILE  NEhFIl. 


60  continue 

READ  (5)  N, ( lOOATA(K) ,K»2,N) 
IF  (E0FI5) I 210.70 
70  continue 


FOR  The  data  set  just  read,  extract  FRCM  I0DATA(2)  through 
iooata(6i  The  locations  in  iooata  and  the  corresponding  lengths  of 
the  data  items  to  be  used  in  The  catalog  listing,  k is  used  to 

SPECIFY  WHICH  WORD  OF  the  ICDATA  ARRAY  CONTAINS  THIS  INFORMATION 
FOR  the  I-TH  data  item.  L SPECIFIES  WHICH  15  BITS  WITHIN  THAT  wCRD 
CONTAIN  THE  INFORMATION.  THE  SHIFT.  MASK.  AND  .AND.  FUNCTIONS  APE 
USED  TO  ISOLATE  THE  INFORMATION  WHICH  IS  STORED  IN  uENLOC  IN 
INTEGER  FORMAT. 


DO  80  K»1  .40 
LENL0C1K)»0 

00  continue 
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::  3C  : = ; ,s: 

^ ^ ~ . \o 


-£N=5r*:f'  :CCA'A  . ^-1  •;5  anc  “^ask  £. 

iCDA'A  i«-»!i,c*  -*'.  *.5  .ANC.'''A5^  9 
-£Ni-CC  : .■^L  =5H!r”  _E'j.6i 
-EnlCC  3.IVW  =9^ I F ' 1 LCC  . 3 ) 

3C  :cn’:n^e 

:c  NC'  FOCCE55  3A'a  SE'  IF  :^;=S*  I *£•-  3CE5  NC’  EC5 

;f  _eslcc I : . ;ndi ; I ) .eq. 0 1 go  tc  sc 


;f  *-e  •'OS’  fecen^LY  feaO  OA'A  SE'  is  'C  be  FEf«E5En:E0  in  *i-E 
Ca'AlOO  listing,  examine  the  FIRS”  OA'A  1 ’’EM  FCP  “LL'IP'-E  EN*FIE5. 
ANC  IF  FFESENT.  break  out  'i-E  NEN'Rv  entries  and  store  '-E*^  in 
-1ST*.  *t-(E  F'.uS  sign  SEPARA'ES  tle  multiple  entries,  if  anv. 


INriALIZE 

INITI AL=LENLCC(2. INDl 1 ) ) 

-as*»init:al»lEnloc( i . inoi i ) i-i 
IFCS=IPOSP=0 
OC  100  K=1  ,250 
lISTT.K1=0 
100  CONTINUE 
NENTRY= 1 

ANALYSE  the  first  data  ITEM  CHARACTER  BY  CHARAC’EP  . SIGNS 
DC  130  K^INriAL.LAST 
OC  120  L=! . 10 

ICT»SHIFT(  ICCA*A(KJ  . IBI  T(  I2-L1  I . AND  . ‘^ASK  ( 6 1 
IF  I ICT.NE . IL»1  go  to  110 
-EN=I IPOS-1 ) / 10*1 
IF  ( IPOSP.GT.0)  -EN»LEN»1 
L.ISTT.  NENTRV.  ; ) *lEN 
NEN’’Ry*NEN’’RY*  1 
IF  (NENTRY.GT. 10)  GO  TO  IMO 
1=05* IPOSP*0 
GC  ’’0  120 
110  CCN’’INUE 

I?0S*IPC5*1 

IPCSP- IPOSP»l 

IF  I IPCSP.EG. 1 1 ) IPCSP-1 

I’’*(IPCS-li.'10*2 

-ISTTINENTRy. 1T)«L15TT(NENTRY. IT).CR.3HIFT(ICT,I3IT| IPCSPl 1 

120  CCnyinuE 
130  CCN’INUE 
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1 : ;s5LPe  poof£s  'e^^^ina'icn  cf  et^' 

■j  wEN=i;pcs-n  ic-i 

- - : 5TT ( NEN’Oy . : ■ =^£N 

1 :'■  (L ;5tt(sentov 1 1 . 3 A^,c  =r-ir'  ''as><(  ;2- . ;= 

_ ;'RY._EN)  1 .£0.3)  l;5T*  nen"=’y . 1 i =l ! 5"  NE“r=y.;  -: 

• :'*0  CCNTlNtE 

;f  (\ENTfy.3’.  IC)  NEN^PY’^IO 


FQPr*  SENTRY  CHARAC’EP  S'^RINGS  FCR  Yi-E  ^A'A  SE * SE ! NG  PRCCESSEE. 
AND  STORE  them  !N  ISCRT.  ’’HiS  15  GONE  3Y  INSER^’iNG  'h£  nEN'P'' 
ENTRIES  !N  LiST’  BACK  !N’’0  iCCA'A  ONE  3Y  ONE.  AND  'HEN  _OCp:nG  ON 
NI'EM  TO  assemble  Oa'A  I'^EmS  IN  '--£  PROPER  CPCER. 


30  BOO  K*1 .NENTRY 
NL!ST-NLIST»1 

C FIRST  data  item  had  only  OnE  EN'Ry  . NO  NEED  TO  PL^  '.N'O  :C3A'4 

If  inentry .EQ. ! I GO  TO  160 
C PLACE  AN  entry  from  LIST'y  IN  ICDATA 

LAST*LISTT(<. 1 1 
LENLOCI 1 , IND( 1 ) )*LAST 
CO  150  L»1  .LAST 

IC0aTA(LENL0C(2, 1ND( I ) ) -L-n =LISTT IK .L* 1 I 
150  continue 
160  continue 

C assemble  the  character  string 

ISUM»1 

30  190  J-1 ,NI TEM 

len=lenlcc(  1 . inou)  ) 
lcc»lenloc  (2.  ind<  jn 

IF  1LCC.EQ.01  LEN*1 
IF  (LOC.NE.O)  GO  ’’O  170 

c DO  NOT  Allow  'HE  character  string  to  EYCE3E  l.ENl’.“  «cpcs 

IF  ( ISUM*1  .GT.LENLlM-n  GO  TO  190 
1 SORT  inlist. ISUM^l I-IOH;; 

1SUM«!SUM-1 
I SORT (NL 1ST. I )«ISUM 
GO  TO  190 
ro  CONTINUE 

C DO  NOT  ALLOW  the  character  S'RING  TO  EXCE3E  -ENLl^  «;P3S 

IF  ( ISUM*LEN.GT .LENL IM-n  LEN«LENL I M- I SUM- I 
IF  (LEN.lE.O)  go  to  190 
DO  180  M-l .lEN 

ISORTINLIST. ISUM»M).I0DATAIL0C*M-1 ) 

.1  180  continue 

!SUM-1SUM*lEN 
150RTt.NL  1ST.  1 !*1SUM 
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;5C  ccn":me 
^CQ  CONT" ! NUE 

C ' " 20  NCT  Al_Cu  \SC9''  apray  TC  OvERFLC^ 

IF  (~'tL’’LST.EC. : .and.nli5t.3e.l:y'lp-:c)  go  :c  3=2 

■Z  2CNTINL/E  NEXT  GATa  3E'' 

30  ”^0  50 

: OA’A  BA5E  EXhAUSTEO 

=:o  continle 
ENDa’a*  1 

: SUBSET  OF  character  strings  PEAQv  ■'0  BE  CPCEREO 

c20  CONTINUE 

l!STTOT*lISTTOT»NLIST 


ORDER  the  ISCRT  ARRAY  AUPHaNUHER I CALL y . T«E  RRCCECURE  DE=>ENCS  ON  0 
“ERGING 'CYCLES.  IN  THE  FIRST  CYCLE  CHARACTER  SPRINGS  ARE  ‘*E=GEO 
BY  PAIRS  - 1 AND  B.  3 AND  ’■* . ETC.  ■YQ  FCRH  ORDERED  PAIRS  OF 
character  STRINGS.  IN  the  SECOND  CYCLE  ThE  FIRS'  PAIR  IS  HERGED 
HlTh  THE  SECOND.  yhE  'HiRi'  i^ITH  THE  FOURTH,  E"^C.  TO  ~0P^  ORDERED 
OUADS  OF  character  STRING!  . THE  RESULT  OF  THE  9-'H  CYCLE  IS  A 
completely  ordered  set  of  character  strings,  yme  order  of  Tme 
character  strings  is  maintmned  in  the  inad  and  inads  arrays. 


initialize  inads 

DO  230  I«1 .LIMUP 
I NADS t I ) »1 
230  continue 
C begin  the  9 CYCLE  PROCESS 

do  370  1*1.9 
N2«2»*( I-! ) 

1 1S--N2«2+1 

DO  350  J*l .L1MUPH.N2 

I 1*1 1S*I 1S*N2*2 

12*I2S*Il*N2 

INITIAL*!  1 

:_ast*I2*n2-i 

L»2 

DO  3H0  K»INITIal.LAST 
2M0  CONTINUE 

IF  (K.GT. 12)  GO  TO  250 
250  CONTINUE 

IF  ( ISORTC INADSI I 1 ) .U .GE .0 . AND. ISCRTI INADSI 121 .L 1 . GE . 0i  GO  TO  260 
IF  ( ISORTl INADSI I I 1 .L) .LT .0 .AND. ISCRTI INADSI  12)  ,L1  .lT . 0)  GO  *0  260 
IF  ( ISCRTI INADSI I I 1 .D .GE.O)  GO  TO  270 
GO  TO  290 
260  CONTINUE 

IF  ( ISORTI INADSI I 1 I .Ll-ISORTI INADSI 12) ,L) 1 2"0.290.290 


ooooo  oo  o •"')  <>  oo 
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PLACE  TME  II  CHAPACTEP  string  BE'^CPE  'i-E  IE  Cwa  = aC’E=  S'^ING 

continue 

1NAD(K1 * INADSi I 1 ) 

I 1*1  1»1 
L = E 

IF  ( I 1 .EQ.  I2S)  GC  TO  300 
GO  ^0  3U0 

THE  l-th  words  of  the  character  strings  are  iden’icai..  compare 

the  IL»1 1ST  WORDS 

EBO  continue 

IF  fU.EQ.LENUIH-1 ) GO  TO  E90 

L»L*1 

GO  TO  2^*0 

PLACE  the  is  character  string  before  'HE  i;  character  S'RING 

E90  continue 

:nadik)«inad5( lEi 

IE-I2*1 

l=E 

IF  ( 12. GT. LAST)  GO  TO  320 
GO  TO  3>40 

THE  n character  string  list  is  exhausted 
300  continue 

KP-K 

DO  310  'l-IE.LAST 
<p.KP*l 

INADIKP) « INADSlM) 

310  continue 
GO  TO  350 

the  12  CHARACTER  Si’RING  LIST  IS  EXHAUSTED 
320  continue 
1 1P-I2S-1 
KP»K 

DO  330  M-l I , I IP 
KP«KP*1 

INAD(KP)»INADSIH) 

330  continue 
GO  TO  350 
3'*0  CONTINUE 

SAVE  "^HE  HOST  RECENT  ORDER  FOR  'HE  NEXT  CYCLE  OR  FOP  Th£ 
storage  operation  to  FOLLOW 
350  continue 

00  360  I L • 1 . L I HUP 

1 NADS ( IL)«1NAD( ID 
360  continue 

370  CONTINUE 


WRITE  The  character  STRINGS  ON  SCRATCH  haS5  STORAGE  IN  '«E  ORDER 
SPECIFIED  BY  INAOS.  IF  'HE  total  NUHBER  OF  SUBSETS  OF  CHARACTER 
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5"INGS  CBCE=E3  '^.!S  CA'A;.OG  -IS'ING  !E  CGC . 'APE;. 

!P  E>E“4,  »^B1*E  EN  aPE2. 


N 'P =N 'P » ; 

1 'PST.<rEST*N’P  2.0 
SP«PT»! 

;p  ( rES'.pQ.x'EST  NP«B-=a 
\P  (NPwBT  ) *NP  (■‘.P^R’')  ♦ 1 
_p 1NP«RT .MF (SPwBT ) ) »NL IS' 

CO  380  i*l .NlIST 
N» ISCRT l INAOSI  I ) . ; ) 

<,RI:E  INF^RTI  N.  ( ISCRT(  INADSI  : ) .<1  ,<<=2.N! 
380  CCNTINL'E 

;p  (ENCaTa .EQ. 1 ) GO  TO  390 
30  TO  hO 


•xERGE  the  data  S'ORED  ON  TAPEl  AND  TAPE2  '0  =oCCUCE  A SINGLE 
ORDERED  SET  OF  ALL  CHARACTER  STRINGS.  S^BSE'S  PROM  'AFEl  APE 
‘MERGED  with  those  FROM  TAPE2.  T.hE  RESULTING  -ERGED  SUBSETS  APE 
WRITTEN  AlTERNA'ELY  ON  TAFE3  AND  TaPEH  . «HEN  A,.^  SUBSE'S  PRCn 
TAPEl  AND  TAPE2  HAVE  BEEN  MERGED.  THE  PROCESS  REVERSES  AND  SUBSE'S 
PRCM  TAPE3  AND  tape**  are  merged  WI^h  The  RESUL'S  being  wRIT'EN 
ON  TAPEl  AND  TAPE2.  THIS  PROCESS  continues  UNt;l  A PINAL  OPCEBED 
LIST  IS  ACHEIVED. 


390  CONTINUE 
C BEGIN  POP  NEW  SUBSETS 

NFTEST«1 
REWIND  I 
REWIND  2 
REWIND  3 
REWIND  M 

NFP ( l ) »NFP ( 2 ) =NFP ( 3 ) -NFP ( U ) »0 

hCO  continue 

C BEGIN  POP  NEW  lists 

NCPEADl »NCPEAD2«0 
NP 1 «NP2*0 

IF  ( ICIP.NE.-l ) GO  TO  HiO 

C WILL  READ  from  units  I and  2.  wRI'E  CN  UNI'S  3 OR  H 

NFPDl »1 
NFRD2-2 

NFP I 1 l»NFP( 1 )♦! 

NFP(2) •NFPt2) ♦! 

NF'EST.-1»NFTE5T 


■ 


“jr^P’  = 3 

iSr ’ES' .El.  1 1 

*!C  ccn’’:nle 

T I ; 00  *0  -*2C  _ . 

«IlL  3EAD  ''=c^'  LNi'S  3 anC  ^ - 

Nrac.  =3 
\rPC2=‘* 

NFPl 31 =NE^( 31*1 

P(H  ) s^FO(^♦ ) » ; 

NE'"E5"  = - 1 ‘NEVES' 

NFhP’’=! 

[ f l NE  TES^  . c-0  . 4 I SE^P  -2 
■*2Q  CCN'INbE 

IE  inE(NEPC21 .E3.C)  NCPEAD2=! 

•r  (nEinEPDI 1 .EQ. 01  00  TO  520 
“►30  CONTINUE  ,,  ,-n  *r, 

;r  (NCREAOI  .EO.  : .ANC.NCREA^.a.LJ.  1 1 ^0 

[E  inOREADi  .EQ.  : . AND.NnTE5Tl  .EO.O)  4^0  ' 

READ  RRCM  NERO! 

READ  INERDII  N.  1 iCA(i<)  ,.<=2.NI 

NUTEST1=1 

ICA(N*l )=1CH; ; 

'i*e'*Nr1*EQ.  LEI  NERD!  .NFPINEROI  i 1 ' nCPEAC1  = 
IE  (NCREAD1.EQ.11  NEINERCl 1»NE(NEPCI 1-1 
IE  (NCREAD2.EQ. 1 .ANO.NuTEST2.EQ.Ol  GO  .0 
IE  INR2.EQ.01  GO  TO 
30  TO  L50 

hhQ  CONTINUE  ,,  .-n  -n 

IE  ( nCREADI .EQ . I . AND . NCREAu2 .EQ . I 1 uO  0 
IE  (NCREAD2.EQ. 1 1 GO  TO  ^80 
.BEAD  EROM  NERD2 
READ  (NER02)  M. ( ICBIKl .K=2. Ml 
NUTEST2«1 
IC9(M*l l=10H; : 

>gP2«NR2*l 

IE  (NR2.EQ.LE(NEPD2.NEP(NEPC21 1 1 NCREAlE 
IE  (NCREaD2.EQ.11  NE (NERC21 =nE (NERD21 -1 
IE  (N0READ1.EQ.11  GO  TO  500 
•♦50  CONTINUE 

COMPARE  !CA  and  ICB 

L*2 

‘♦60  continue 

IE  (ICAILI.GE  O.AND.  ICBILl  .GtL.Ol  GO 
IE  ( ICA(L1 .LT. O.AND. ICBILl .lT .Ql  GO 

IE  ( ICAILl  .GE.Ol  GO  TO  •♦80 
GO  TO  500 

470  CONTINUE  „ 

IE  ( ICAILl-lCBILl  1 •♦60. 490. 500 
400  CONTINUE 


♦00 


510 


TO 

TO  4- 


38 


IP  K' 


o r>  o o o o o o o o o 


v*Ol 


hp:*e  n.  1 icaih.)  ,k,^2,ni) 

‘.w  T 1 5 '’’ ! = Q 

GO  ^3  ‘♦3C 
->9C  CCNTISL’E 

IF  i !CA  ,i_l  .E3. 'CM;  : , anD.L.EG.-lNLIM-!  ) GO  ’3  5C3 

:.=L*1 
GO  TO  ^63 
53C  CCNTINOE 

«R!’’E  (NF^RT)  'I,  ( ICaiKl  ,K  = 2.^) 

N^J''EST2  = 0 
^ ’2 

GO  ’’0  ^^0 
513  C ON  T ! NCE 

: ""  “ERGING  OF  TWO  SUBSETS  IS  COMPLETE 

IF  (NWTESTI  .EQ.  1 ) WRITE  (NFwRTl  N .(  I CA ( K ), K = 2 . N ) 

I"  (NW’’EST2EQ.  1)  WRITE  (NFWRTl  M , ( I CB  l K ) , K=2  , M ) 

'JF  ( SF i<^R T ) *NF  ( NFWR  T ) ♦ 1 
LF (NFWPT ,NF (NF«RT ) 1 =NP1 ♦NP2 
IF  1NR2.NE.0)  NT^=NTF-1 
GO  TO  -*00 
520  continue 

IF  (NTF.EQ. 1 ) GO  TO  530 
IDIR»-I*IDIP 
GO  •'0  390 
530  continue 

C A SINGLE  ORDERED  LIST  HAS  BEEN  ACHEIVED 

DO  5H0  I’l .3.2 
IF  1NF( I) ,EQ. 1 ) NFWRT=I 

5H0  continue 

REWIND  NFWRT 


PREPARE  THE  final  ORDERED  LIST  OF  CHARACTER  STRINGS  FOP  pniN'ING 
AND  PRINT  THEM  AS  they  ARE  PROCESSED.  THE  PREPARATION  REOUlRES  A 
character  BY  CHARACTER  EXAMINATION  OF  EACH  CHARACTER  STRING  TQ 
CONVERT  COLONS  TO  BLANKS.  TO  ELIMINATE  EXCESS  BLANKS.  AND  ^3 
INSERT  - AS  A separator  BETWEEN  DATA  ITEMS. 


IF  ( IND( 1 ) .NE . 1 1 ) PRINT  710 
IPRNTSV" 1 OH 
NCOUNT«0 
LST»LF( NFWRT,  1 ) 

C PROCESS  EACH  character  STRING 

DO  690  L»1 .LST 

READ  (NFWRT)  N. ( ICDATa(K) .K=2,N) 

c initialize  IPRINT 


39 


O f > 


-AST^N 

3C  550  1=1 .last 
ippiNT ( : ) =0 
550  CONTINUE 

C SPACE  DA’'A  for  CU’’PUT  CLARl'Y 

:pos=ipcsp=ispace=o 

C EXArilNE  EACH  character  IN  !CCAya(11 

DO  5HQ  1=2. ^AST 

IF  c iOCA’Al I ) .EQ. lOH; ; ) ICDA"AI|!=0 

00  530  J=1  . 10 
ICT=I00ATA(  I 1 .AND.  IMASK(J) 

IF  ( ICT ,NE . OB)  GO  TO  580 

C CHARACTER  IS  A COLON  (SPACE) 

1SPACE=ISPACE»1 
IF  ( ISPACE.GT.2)  GO  TO  630 
IF  ( iSPACE.EQ. 1 ) GO  TO  570 
C END  OF  DA'A  I TEH.  INSER’’  - 

IPOS=!PCS*l 
IPCSP=IPOSP*l 
IF  (IPCSP.EQ.il)  IPOSP=l 
!WORO»( IPCS-1 ) / 10*1 

1 PRINT! 1W0RD)  = IPRINT( I WORD)  . OR . SHI  FT ( 1 l- . I B 1 T ( IPOSP) ) 

C IF  LISTING  BY  STUDY  BEGIN  EACH  STUDY  ON  NEw  PAGE 

IF  ( lUCRD.NE. 1 ) GO  TO  570 

IF  ( IND( 1 ) .NE. 1 1 .OR. IPR1NT( 1 ) ,EQ. IPRNTSV)  GO  TO  560 
PRINT  710 
NCOUNT=0 
560  CONTINUE 

IPRNTSV* I PR  I NT ( 1 ) 

5^0  continue 

C CONVERT  COLON  TO  A BLANK 

I C T*55555555555555555555B . AND . I MASK ( J ) 

GO  TO  590 
580  continue 

C character  not  a colon  ( space) 

ISPACE=0 
590  CONTINUE 

C make  character  part  of  character  string  to  3E  PRIN'EO  530. 

IPOS»IPOS*l 
1P0SP»1P0SP»1 
IF  (IPOSP.EQ.il)  IPOSP-1 
IUCRD«( IPOS-1 ) / 10*1 

CE'^ERMINE  NUMBER  OF  BITS  CHARACTER  IN  ICT  MUST  5E  SmI^’e;:  rcP 
storage  in  IPR|NT( IWCRD)  (620) 

IF  (IPCSP-J)  610.600.600 
600  CONTINUE 

C SHIFT  TO  LOWER  ORDER  BITS  OR  DO  NOT  SHIFT  AT  ALL 

p.  ISHIFT*IPOSP-J*l 

GO  TO  620 
610  CONTINUE 


! 


• i ' m \i  Lj  imL»  ^ 


"C  “<:C'nE=^  CPCEP  3!  ”5 
:'3r.i- '^;a-o-.PCSP*i 
:CN' : vE 

;=o:n’’i  ;«cpc)  = iPPiNT  1 :«cpc)  .CR.SniPT(  :c”.  :3!T(  :S'^:e'"  ; 

CCN" :nle 

CZ^riMt_£ 

;p  . ;pcsp.EQ. !C)  GO  to  eeo 

3LANK  fill  PEMAINCEP  OF  1 PP  ! NT  I I r<CRC  ) 

00  650  I=1PCSP, !0 

iccjiNTi !«CRC) =;pp;nti :wcrd/ .op. 1555555555555555555553. and. i^ask  : ) 
1 ) 

CCN’INUE 
CON’’  INLE 

NL :nE=1 » I UCPD-2) / 12 
:F  NC0UNT»NLINE.LE.58)  go  to  o^G 

NOT  enough  room  on  page  to  print  entire  character  spring. 

EwECT  page 

PRINT  710 
NCCUNT=0 

continue 

PRINT  character  string  HITHCUT  EJECTING  RAGE 
NC0UNT=NC0UNT»NLINE‘-1 
IF  (NLINE.EQ. 1 ) GO  TO  680 

character  string  requires  more  than  one  line  FOR  printing 

PRINT  700.  ( IPRINT(U) ,U=1 . 13) 

PRINT  720.  ! IPRINT(U) . J=IU. IWORD) 

GO  TO  590 

continue 

character  string  REQUIRES  ONLY  ONE  LINE  FOR  PRINTING 
PRINT  700.  ( IPRINTf J) .U=l . lUCRD) 

CONTINUE 

PRINT  730.  LISTTOT 

catalog  listing  completed.  CHECK  FCR  ADDITIONAL  LISTINGS 
GO  TO  20 


format  (/1X.I3AI0) 
format  (IHl//) 

FORMAT  (1IX.12A10) 

FORMAT  ( / / /20X. I5.28H  ENTRIES  APPEAR  ON  THIS  LIST) 
END 


SUBROUTINE  INPUT 
COMMON  /A/  IND(20) .NITEM 
DIMENSION  NAME! 10) 

DATA  IRETURN/1/ 

DATA  NAME / >4H I TEM . 7HENDCASE  , 6HEND JOB/ 

C 

PRINT  80 
NI TEM*0 
10  CONTINUE 

READ  50.  I NAME. ILOC.X 
PRINT  60.  I NAME. ILOC.X 
DO  20  1*1.10 

If  ( INAME.EQ.NAMEI I ) ) GO  TO  30 
20  CONTINUE 
IRETURN-0 
PRINT  70.  INAME 
30  CONTINUE 

IF  ( INAME.NE.NAMEI 1 ) ) GO  TO  HO 
!NO( IL0C)»X 
NITEM-NITEMfl 
HO  CONTINUE 

IF  ( INAME.EQ.NAME12) .AND. IRETURN.EQ. 1 ) RETURN 
IF  ( INAME.EQ.NAME13) ) STOP  1 
GO  TO  10 
C 

c 

50  format  (A10.I5.F10.0) 

60  FORMAT  (1X.A10.I5.F10.1) 

70  format  (5H  •••.1X.A10.5H .21H  IS  NOT  IN  DICTIONARY) 

80  format  ( 1H1 ) 

END 


42 


